Optimal Web-Scale Tiering as a Flow Problem
نویسندگان
چکیده
We present a fast online solver for large scale parametric max-flow problems as they occur in portfolio optimization, inventory management, computer vision, and logistics; •Our algorithm solves an integer linear program in an online fashion; • It exploits total unimodularity of the constraint matrix and a Lagrangian relaxation to solve the problem as a convex online game; • The algorithm generates approximate solutions of max-flow problems by performing stochastic gradient descent on a set of flows; •We apply the algorithm to optimize tier arrangement of over 80 Million web pages on a layered set of caches to serve an incoming query stream optimally. Motivating Example The Tiering Problem Goal: • Select documents to be stored in successive tiers or caches of decreasing access frequency • such that frequently accessed documents are found in the highest tiers • thus the search engine will be able to cover incoming queries with low latency and computational load. One proposed solution: •Assign a value to each document and arrange them such that the highest valued documents reside in the highest levels of the cache; • But this is sub-optimal. Reason: to answer a given query well, a search engine returns not only a single document but a list of r (typically r = 10) documents. doc 1 doc 2. doc 3 doc 4 doc 5 doc 6 doc 7 doc 8 doc 9 doc 10 cache 1
منابع مشابه
Minimizing Makespan with Start Time Dependent Jobs in a Two Machine Flow Shop
[if gte mso 9]> The purpose of this paper is to consider the problem of scheduling a set of start time-dependent jobs in a two-machine flow shop, in which the actual processing times of jobs increase linearly according to their starting time. The objective of this problem is to minimize the makespan. The problem is known to be NP-hardness[ah1] ; therefore, there is no polynomial-time algorithm...
متن کاملThe Optimal Power Flow of Multiple Energy Carriers in Networked Multi-Carrier Microgrid
The future distribution network comprising different energy carriers will include small-scale energy resources (SSERs) and loads, known as a Networked multi-carrier microgrid (NMCMG). This concept not only leads to an efficient reduction in operation costs, but also encompasses the energy transformation between gas and electric networks at combined nodes, as well as district heating networks. I...
متن کاملMILP Formulation and Genetic Algorithm for Non-permutation Flow Shop Scheduling Problem with Availability Constraints
In this paper, we consider a flow shop scheduling problem with availability constraints (FSSPAC) for the objective of minimizing the makespan. In such a problem, machines are not continuously available for processing jobs due to preventive maintenance activities. We proposed a mixed-integer linear programming (MILP) model for this problem which can generate non-permutation schedules. Furthermor...
متن کاملImprovement of DC Optimal Power Flow Problem Based on Nodal Approximation of Transmission Losses
This paper presents a method to improve the accuracy of DC Optimal Power Flow problem, based on evaluating some nodal shares of transmission losses, and illustrates its efficiency through comparing with the conventional DCOPF solution, as well as the full AC one. This method provides three main advantages, confirming its efficiency: 1- It results in such generation levels, line flows, and noda...
متن کاملHeuristic approach to solve hybrid flow shop scheduling problem with unrelated parallel machines
In hybrid flow shop scheduling problem (HFS) with unrelated parallel machines, a set of n jobs are processed on k machines. A mixed integer linear programming (MILP) model for the HFS scheduling problems with unrelated parallel machines has been proposed to minimize the maximum completion time (makespan). Since the problem is shown to be NP-complete, it is necessary to use heuristic methods to ...
متن کامل